AITopics | variation budget

Collaborating Authors

variation budget

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ABiased Graph Neural Network Sampler with Near-Optimal Regret

Neural Information Processing SystemsMay-1-2026, 02:25:12 GMT

Graph neural networks (GNN) have recently emerged as a vehicle for applying deep network architectures to graph and relational data. However, given the increasing size of industrial datasets, in many practical situations the message passing computations required for sharing information across GNN layers are no longer scalable. Although various sampling methods have been introduced to approximate full-graph training within a tractable budget, there remain unresolved complications such as high variances and limited theoretical guarantees. To address these issues, we build upon existing work and treat GNN neighbor sampling as a multi-armed bandit problem but with a newly-designed reward function that introduces some degree of bias designed to reduce variance and avoid unstable, possibly-unbounded pay outs. And unlike prior bandit-GNN use cases, the resulting policy leads to near-optimal regret while accounting for the GNN training dynamics introduced by SGD.

artificial intelligence, data mining, machine learning, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Version of Our Algorithm

Neural Information Processing SystemsApr-25-2026, 18:21:54 GMT

GCNGAT Algorithmγη Tγη T Thanos 0.4 0.01 1000 0.4 0.01 1000 BanditSampler 0.4 0.01 N/A 0.4 0.01 N/A Table 5: The detailed sampling hyperparameters for Squirrel.

artificial intelligence, bri, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

5cc3749a6e56ef6d656735dff9176074-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 13:46:04 GMT

algorithm, greedy-baa, variation budget, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Nebraska > Lancaster County > Lincoln (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Hampshire > Southampton (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Data Science > Data Mining > Big Data (0.47)

Add feedback

5cc3749a6e56ef6d656735dff9176074-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 13:45:57 GMT

algorithm, bandit, variation budget, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Nebraska > Lancaster County > Lincoln (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Hampshire > Southampton (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
Information Technology > Data Science > Data Mining > Big Data (0.49)

Add feedback

Provably Efficient Algorithm for Nonstationary Low-Rank MDPs

Neural Information Processing SystemsFeb-8-2026, 03:45:34 GMT

However, in practice, the environment is typically time-varying and nonstationary .

data mining, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.04)
North America > United States > Ohio (0.04)
Asia > Singapore (0.04)

Technology:

Information Technology > Data Science > Data Mining (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.67)
Information Technology > Artificial Intelligence > Robots (0.67)
(2 more...)

Add feedback

145c28cd4b1df9b426990fd68045f4f7-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 03:45:31 GMT

equation, transition kernel, variation budget, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.04)
North America > United States > Ohio (0.04)
Asia > Singapore (0.04)

Technology:

Information Technology > Data Science > Data Mining (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.67)
Information Technology > Artificial Intelligence > Robots (0.67)
(2 more...)

Add feedback

Adversarial Blocking Bandits

Neural Information Processing SystemsDec-24-2025, 02:09:09 GMT

We consider a general adversarial multi-armed blocking bandit setting where each played arm can be blocked (unavailable) for some time periods and the reward per arm is given at each time period adversarially without obeying any distribution. The setting models scenarios of allocating scarce limited supplies (e.g., arms) where the supplies replenish and can be reused only after certain time periods. We first show that, in the optimization setting, when the blocking durations and rewards are known in advance, finding an optimal policy (e.g., determining which arm per round) that maximises the cumulative reward is strongly NP-hard, eliminating the possibility of a fully polynomial-time approximation scheme (FPTAS) for the problem unless P = NP. To complement our result, we show that a greedy algorithm that plays the best available arm at each round provides an approximation guarantee that depends on the blocking durations and the path variance of the rewards. In the bandit setting, when the blocking durations and rewards are not known, we design two algorithms, RGA and RGA-META, for the case of bounded duration an path variation.

adversarial blocking bandit, approximate regret, name change, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.38)

Add feedback

Efficient Restarts in Non-Stationary Model-Free Reinforcement Learning

Nonaka, Hiroshi, Ambrozak, Simon, Miskala-Dinc, Sofia R., Ercole, Amedeo, Prins, Aviva

arXiv.org Artificial IntelligenceOct-15-2025

In this work, we propose three efficient restart paradigms for model-free non-stationary reinforcement learning (RL). We identify two core issues with the restart design of Mao et al. (2022)'s RestartQ-UCB algorithm: (1) complete forgetting, where all the information learned about an environment is lost after a restart, and (2) scheduled restarts, in which restarts occur only at predefined timings, regardless of the incompatibility of the policy with the current environment dynamics. We introduce three approaches, which we call partial, adaptive, and selective restarts to modify the algorithms RestartQ-UCB and RANDOMIZEDQ (Wang et al., 2025). We find near-optimal empirical performance in multiple different environments, decreasing dynamic regret by up to $91$% relative to RestartQ-UCB.

machine learning, reinforcement learning, restart, (16 more...)

arXiv.org Artificial Intelligence

2510.11933

Country: North America > United States > Maryland (0.14)

Genre: Research Report (0.83)

Industry: Leisure & Entertainment (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback